312 research outputs found

    Comparing feature matching for object categorization in video surveillance

    Get PDF
    In this paper we consider an object categorization system using local HMAX features. Two feature matching techniques are compared: the MAX technique, originally proposed in the HMAX framework, and the histogram technique originating from Bag-of-Words literature. We have found that each of these techniques have their own field of operation. The histogram technique clearly outperforms the MAX technique with 5-15% for small dictionaries up to 500-1,000 features, favoring this technique for embedded (surveillance) applications. Additionally, we have evaluated the influence of interest point operators in the system. A first experiment analyzes the effect of dictionary creation and has showed that random dictionaries outperform dictionaries created from Hessian-Laplace points. Secondly, the effect of operators in the dictionary matching stage has been evaluated. Processing all image points outperforms the point selection from the Hessian-Laplace operator

    Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks

    Full text link
    Deep neural networks have emerged as a widely used and effective means for tackling complex, real-world problems. However, a major obstacle in applying them to safety-critical systems is the great difficulty in providing formal guarantees about their behavior. We present a novel, scalable, and efficient technique for verifying properties of deep neural networks (or providing counter-examples). The technique is based on the simplex method, extended to handle the non-convex Rectified Linear Unit (ReLU) activation function, which is a crucial ingredient in many modern neural networks. The verification procedure tackles neural networks as a whole, without making any simplifying assumptions. We evaluated our technique on a prototype deep neural network implementation of the next-generation airborne collision avoidance system for unmanned aircraft (ACAS Xu). Results show that our technique can successfully prove properties of networks that are an order of magnitude larger than the largest networks verified using existing methods.Comment: This is the extended version of a paper with the same title that appeared at CAV 201

    A visual category filter for Google images

    Get PDF
    We extend the constellation model to include heterogeneous parts which may represent either the appearance or the geometry of a region of the object. The pans and their spatial configuration are learnt simultaneously and automatically, without supervision, from cluttered images. We describe how this model can be employed for ranking the output of an image search engine when searching for object categories. It is shown that visual consistencies in the output images can be identified, and then used to rank the images according to their closeness to the visual object category. Although the proportion of good images may be small, the algorithm is designed to be robust and is capable of learning in either a totally unsupervised manner, or with a very limited amount of supervision. We demonstrate the method on image sets returned by Google's image search for a number of object categories including bottles, camels, cars, horses, tigers and zebras

    Defining the essence of innovation how important terms in promoting of transformation processes in Ukraine

    Get PDF
    Feature hierarchies are essential to many visual object recognition systems and are well motivated by observations in biological systems. The present paper proposes an algorithm to incrementally compute feature hierarchies. The features are represented as estimated densities, using a variant of local soft histograms. The kernel functions used for this estimation in conjunction with their unitary extension establish a tight frame and results from framelet theory apply. Traversing the feature hierarchy requires resampling of the spatial and the feature bins. For the resampling, we derive a multi-resolution scheme for quadratic spline kernels and we derive an optimization algorithm for the upsampling. We complement the theoretic results by some illustrative experiments, consideration of convergence rate and computational efficiency.DIPLECSGARNICSELLII

    Patch-Based Experiments with Object Classification in Video Surveillance

    Get PDF
    We present a patch-based algorithm for the purpose of object classification in video surveillance. Within detected regions-of-interest (ROIs) of moving objects in the scene, a feature vector is calculated based on template matching of a large set of image patches. Instead of matching direct image pixels, we use Gabor-filtered versions of the input image at several scales. This approach has been adopted from recent experiments in generic object-recognition tasks. We present results for a new typical video surveillance dataset containing over 9,000 object images. Furthermore, we compare our system performance with another existing smaller surveillance dataset. We have found that with 50 training samples or higher, our detection rate is on the average above 95%. Because of the inherent scalability of the algorithm, an embedded system implementation is well within reach

    Integrated information increases with fitness in the evolution of animats

    Get PDF
    One of the hallmarks of biological organisms is their ability to integrate disparate information sources to optimize their behavior in complex environments. How this capability can be quantified and related to the functional complexity of an organism remains a challenging problem, in particular since organismal functional complexity is not well-defined. We present here several candidate measures that quantify information and integration, and study their dependence on fitness as an artificial agent ("animat") evolves over thousands of generations to solve a navigation task in a simple, simulated environment. We compare the ability of these measures to predict high fitness with more conventional information-theoretic processing measures. As the animat adapts by increasing its "fit" to the world, information integration and processing increase commensurately along the evolutionary line of descent. We suggest that the correlation of fitness with information integration and with processing measures implies that high fitness requires both information processing as well as integration, but that information integration may be a better measure when the task requires memory. A correlation of measures of information integration (but also information processing) and fitness strongly suggests that these measures reflect the functional complexity of the animat, and that such measures can be used to quantify functional complexity even in the absence of fitness data.Comment: 27 pages, 8 figures, one supplementary figure. Three supplementary video files available on request. Version commensurate with published text in PLoS Comput. Bio

    Natural images from the birthplace of the human eye

    Get PDF
    Here we introduce a database of calibrated natural images publicly available through an easy-to-use web interface. Using a Nikon D70 digital SLR camera, we acquired about 5000 six-megapixel images of Okavango Delta of Botswana, a tropical savanna habitat similar to where the human eye is thought to have evolved. Some sequences of images were captured unsystematically while following a baboon troop, while others were designed to vary a single parameter such as aperture, object distance, time of day or position on the horizon. Images are available in the raw RGB format and in grayscale. Images are also available in units relevant to the physiology of human cone photoreceptors, where pixel values represent the expected number of photoisomerizations per second for cones sensitive to long (L), medium (M) and short (S) wavelengths. This database is distributed under a Creative Commons Attribution-Noncommercial Unported license to facilitate research in computer vision, psychophysics of perception, and visual neuroscience.Comment: Submitted to PLoS ON

    A survey of energy drink consumption patterns among college students

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Energy drink consumption has continued to gain in popularity since the 1997 debut of Red Bull, the current leader in the energy drink market. Although energy drinks are targeted to young adult consumers, there has been little research regarding energy drink consumption patterns among college students in the United States. The purpose of this study was to determine energy drink consumption patterns among college students, prevalence and frequency of energy drink use for six situations, namely for insufficient sleep, to increase energy (in general), while studying, driving long periods of time, drinking with alcohol while partying, and to treat a hangover, and prevalence of adverse side effects and energy drink use dose effects among college energy drink users.</p> <p>Methods</p> <p>Based on the responses from a 32 member college student focus group and a field test, a 19 item survey was used to assess energy drink consumption patterns of 496 randomly surveyed college students attending a state university in the Central Atlantic region of the United States.</p> <p>Results</p> <p>Fifty one percent of participants (<it>n </it>= 253) reported consuming greater than one energy drink each month in an average month for the current semester (defined as energy drink user). The majority of users consumed energy drinks for insufficient sleep (67%), to increase energy (65%), and to drink with alcohol while partying (54%). The majority of users consumed one energy drink to treat most situations although using three or more was a common practice to drink with alcohol while partying (49%). Weekly jolt and crash episodes were experienced by 29% of users, 22% reported ever having headaches, and 19% heart palpitations from consuming energy drinks. There was a significant dose effect only for jolt and crash episodes.</p> <p>Conclusion</p> <p>Using energy drinks is a popular practice among college students for a variety of situations. Although for the majority of situations assessed, users consumed one energy drink with a reported frequency of 1 – 4 days per month, many users consumed three or more when combining with alcohol while partying. Further, side effects from consuming energy drinks are fairly common, and a significant dose effect was found with jolt and crash episodes. Future research should identify if college students recognize the amounts of caffeine that are present in the wide variety of caffeine-containing products that they are consuming, the amounts of caffeine that they are consuming in various situations, and the physical side effects associated with caffeine consumption.</p

    Recurrent Connections Aid Occluded Object Recognition by Discounting Occluders

    Full text link
    Recurrent connections in the visual cortex are thought to aid object recognition when part of the stimulus is occluded. Here we investigate if and how recurrent connections in artificial neural networks similarly aid object recognition. We systematically test and compare architectures comprised of bottom-up (B), lateral (L) and top-down (T) connections. Performance is evaluated on a novel stereoscopic occluded object recognition dataset. The task consists of recognizing one target digit occluded by multiple occluder digits in a pseudo-3D environment. We find that recurrent models perform significantly better than their feedforward counterparts, which were matched in parametric complexity. Furthermore, we analyze how the network's representation of the stimuli evolves over time due to recurrent connections. We show that the recurrent connections tend to move the network's representation of an occluded digit towards its un-occluded version. Our results suggest that both the brain and artificial neural networks can exploit recurrent connectivity to aid occluded object recognition.Comment: 13 pages, 5 figures, accepted at the 28th International Conference on Artificial Neural Networks, published in Springer Lecture Notes in Computer Science vol 1172
    corecore